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Abstract 

In recent years it has become apparent that a Gaussian center bias can serve 
as an important prior for visual saliency detection, which has been demon¬ 
strated for predicting human eye fixations nHa and salient object detection 
|1]. Tseng et al. have shown that the photographer’s tendency to place inter¬ 
esting objects in the center is a likely cause for the center bias of eye fixations 
IS]. We investigate the infiuence of the photographer’s center bias on salient 
object detection, extending our previous work [H]- We show that the centroid 
locations of salient objects in photographs of Achanta and Liu’s data set w 
in fact correlate strongly with a Gaussian model. This is an important in¬ 
sight, because it provides an empirical motivation and justification for the 
integration of such a center bias in salient object detection algorithms and 
helps to understand why Gaussian models are so effective. To assess the in¬ 
fluence of the center bias on salient object detection, we integrate an explicit 
Gaussian center bias model into two state-of-the-art salient object detection 
algorithms piin]. This way, first, we quantify the infiuence of the Gaussian 
center bias on pixel- and segment-based salient object detection. Second, 
we improve the performance in terms of Fi score, TJg score, area under the 
recall-precision curve, area under the receiver operating characteristic curve, 
and hit-rate on the well-known data set by Achanta and Liu [3 [8]. Third, by 
debiasing Cheng et al.’s region contrast model, we exemplarily demonstrate 
that implicit center biases are partially responsible for the outstanding per¬ 
formance of state-of-the-art algorithms. Last but not least, as a result of 
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(a) Example images 


(b) Centroid scatter plot 
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(c) Example segmentation masks 



(d) Mean segment mask 


Eigure 1: Illustration of the Achanta/Liu data set: example imag es|l(a) the corresponding 
segmentation masks l(c)[ the mean over all segmentation masks [1(d) [ and the scatter plot 
of the centroid locations across all images |r(b)| 


debiasing Cheng et al.’s algorithm, we introduce a non-biased salient object 
detection method, which is of interest for applications in which the image 
data is not likely to have a photographer’s center bias (e.g., image data of 
surveillance cameras or autonomous robots). 
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1. Introduction 

Among other influences such as task-specific factors, human attention is 
attracted to salient stimuli. In this context, saliency describes the subjective, 
perceptual quality that lets some items in the world stand out from their 
neighbors and immediately grab our attention. Accordingly, the goal of visual 
saliency detection is to determine what parts of an image are likely to grab 
the human attention. The task of “traditional” visual saliency detection is 
to predict where human observers look when presented with a scene, which 
can be recorded using eye tracking equipment (e.g., pEnnun]). Liu et 
al. adapted the traditional definition of visual saliency by incorporating the 
high level concept of a salient object into the process of visual attention 
computation [7]. Here, a salient object is defined as being the object in an 
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image that attracts most of the user’s interest such as, for example, the 
man, the cross, the baseball players and the flowers in Fig. 1(a) (left-to- 
right, resp.). Accordingly, Liu et al. [7j defined the task of salient object 
detection as the binary labeling problem of separating the salient object from 
the background. Thus, in contrast to traditional visual saliency detection, 
salient object detection does not just comprise of the task to calculate the 
saliency of image regions, but it also incorporates the task to determine and 
segment the most salient object in the image. Here, it is important to note 
that the selection of a salient object happens consciously by the user whereas 
the gaze trajectories that are recorded using eye trackers are the result of 
mostly unconscious processes. Consequently, also taking into account that 
salient objects attract the human gaze (see, e.g., m) , salient object detection 
and predicting where people look are very closely related yet substantially 
different tasks. 

The photographer’s center bias, i.e. the natural tendency of photogra¬ 
phers to place the objects of interest near the center of their composition in 
order to enhance their focus and size relative to the background (see |5]), has 
been identified as one cause for the often reported center bias in eye-tracking 
data during eye-gaze studies [T314150 As a consequence, the integration of 
a center bias has become an increasingly important aspect in visual saliency 
models that focus on gaze prediction (e.g., M). In contrast, most recently 
proposed salient object detection algorithms do not incorporate an explicit 
model of the photographer’s center bias (see, e.g., IHHIDl US]). A notable 
exception and closely related to our work is the work by Jiang et al. 0 , in 
which one of the three main criteria that characterize a salient object is that 
“it is most probably placed near the center of the image” The authors 
justify this characterization with the “rule of thirds”, which is one of the most 
well-known principles of photographic composition (see, e.g., [II|), and use 
a Gaussian distance metric as a model. We go beyond following the rule 
of third and show that the distribution of the objects’ centroids correlates 
strongly positively with a 2-dimensional Gaussian distribution. This means 
nothing less than that we provide a strong empirical justification for inte¬ 
grating Gaussian center bias models into salient object detection algorithms. 


^Here, it is important to note that Tseng et al. - due to their methodology - did not 
investigate the exact spatial distribution of the objects that attract the gaze. They hired 
five persons who provided subjective scores from 1 to 5 in terms of how interesting things 
were biased toward the image center [5]. 
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To demonstrate the importance, we adapt two state-of-the-art salient object 
detection methods to quantify the influence of the photographer’s center bias 
on salient object detection. 

The contribution of this paper is twofold: First, we use the salient ob¬ 
ject data set by Achanta et al. [H] to investigate the spatial distribution 
of salient objects in images. This way, in Sec. we show that it is likely 
that salient objects in photographs are distributed around the image center 
in such a way that the radii are half-Gaussian distributed and the angles 
are uniformly distributed. Second, in Sec.|^ we explicitly integrate Gaussian 
center bias models in two recently proposed salient object detection methods: 
The pixel-based maximum symmetric surround salient object detection by 
Achanta et al. [lO] and the segment-based region contrast method by Gheng 
et al. in]. In order to measure the influence, we use the following evaluation 
measures: The maximum Fi score, the maximum score with jS = \/0.3 |H], 
the area-under-curve of the precision-recall curve, the AUG of the receiver 
operating characteristic (ROG AUG), and the hit-rate. In summary, the in¬ 
tegration of the center bias model increases the ROG AUG by 2% and the 
performance with respect to all remaining measures by roughly 5%. Thus, 
we further advance the state-of-the-art of pixel-based as well as segment- 
based salient object detection. By modifying Gheng et al.’s region contrast 
model IH], first, we obtained a non-biased salient object detection algorithm 
that is based on region contrast and, second, we exemplarily demonstrate 
that implicit center biases can already be found in well-performing, state- 
of-the-art salient object detection algorithms and substantially influence the 
performance. This is important to consider when comparing and selecting al¬ 
gorithms for applications in which the data is not necessarily biased towards 
the center. 

The remainder of this paper is organized as follows: In Sec. we provide 
an overview of related work. Subsequently, in Sec. we introduce and inves¬ 
tigate our hypotheses about the spatial distribution of salient objects. Then, 
in Sec. we integrate our hypotheses into two recently proposed salient 
object detection methods and evaluate the influence on the salient object 
detection performance. We conclude with a short summary and discussion 
in Sec. Furthermore, please feel free to check the supplemental material 
for additional information such as, e.g., further evaluation results. 
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2. Related Work 


We focus on the most recent related work that addresses bottom-up 
saliency detection with an emphasis on salient object detection (see, e.g., 
jIH] for a more general overview of computational attention models). Such 
methods may be biologically motivated, or purely computational, or involve 
both aspects. In 2009, Achanta et ah mm introduced a salient object de¬ 
tection approach that basically relies on the difference of pixels to the average 
color and intensity value. In order to evaluate their approach, they selected 
a sub-set of 1000 images of the image data set that was collected from the 
web by Liu et al. [7] and calculated segmentation masks of the salient ob¬ 
jects that were marked by 9 participants using (rough) rectangle annotations 
m ■ Please note that this procedure also means that during the manual data 
set annotation the selection of the salient object happens mostly conscious 
whereas gaze trajectories that are recorded using eye trackers are a result of 
a mostly unconscious process. Since it was created, the salient object data 
set by Achanta et al. serves as reference data set to evaluate methods for 
salient object detection (see, e.g., IHHinillS]). Liu et al. and Alexe et al. 
ra approach salient object detection using machine learning. To this end, 
Liu et al. |7] combine multi-scale contrast, center-surround histograms, and 
color spatial-distributions with conditional random fields. Similarly, Alexe 
et al. ng combine multi-scale saliency, color contrast, edge density, and 
superpixels in a Bayesian framework. Closely related to Bayesian surprise 
1231. Klein et al. eg use the Kullback-Leibler Divergence of the center and 
surround image patch histograms to calculate the saliency. Cheng et al. [H] 
use segmentation to define a regional contrast-based method, which simulta¬ 
neously evaluates global contrast differences and spatial coherence. Here, we 
can differentiate between algorithms that rely on segmentation-based (e.g., 
mm) and pixel-based contrast measures (e.g., pm HE]). Closely related 
to our work on the quantitative infiuence of the center bias on salient ob¬ 
ject detection is the work by Jiang et al. P and most recently Borji et al. 
[21] . In Jiang et al.’s work P one of the main criteria that characterize 
a salient object is that “it is most probably placed near the center of the 
image”, which is justified with the “rule of thirds”. Most recently, Borji et 
al. PI] evaluated several salient object detection models and also performed 
tests with an additive Gaussian center bias and conclude that the resulting 
“change in accuracy is not significant and does not alter model rankings”. 
But, this neglects the possibility that well-performing models already have 
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an integrated, implicit center bias, which - as one part of our work - we 
demonstrate exemplarily to be the case for Cheng et al.’s region contrast 
algorithm [Hj. Furthermore, there exist several approaches that explicitly 
integrate a center bias, but do not provide a quantitative evaluation of its 
influence nor an empirical justification of the chosen model (e.g., [22]) • In 
this paper, we adapt the pixel-based method by Achanta et ah ng and the 
segmentation-based method by Cheng et ah [21 to incorporate a model of 
the photographer-related center bias and quantify the influence of the center 
bias on the performance. Furthermore, Borji et al. do not provide an 
empirical justification why a Gaussian distribution is an appropriate center 
bias model, which is another part of the work described in this paper. 

It has been observed in several studies that the visual attention of human 
participants in natural scenes is biased toward the center of static images 
and videos (see, e.g., US [IS 123I). One possible bottom-up cause of the 
bias is intrinsic bottom-up visual saliency as predicted by computational 
saliency models. One possible top-down cause of the center bias is known 
as photographer bias (see, e.g., [T3l - [T5] b which describes the natural ten¬ 
dency of photographers to place objects of interest near the center of their 
composition. In fact, what the photographer considers interesting may also 
be highly bottom-up salient. Additionally, the photographer bias may lead 
to a viewing strategy bias [21|, which means that viewers may orient their 
attention more often toward the center of the scene, because they expect 
salient or interesting objects to be placed there. Thus, since in natural im¬ 
ages and videos the distribution of objects of interest and thus saliency is 
usually biased toward the center, it is often unclear how much the saliency 
actually contributes in guiding attention. It is possible that people look at 
the center for reasons other than saliency, but their gaze happens to fall on 
salient locations. Therefore, this center bias may result in overestimating 
the influence of saliency computed by the model and contaminate the eval¬ 
uation of how visual saliency may guide orienting behavior. Recently, Tseng 
et al. [2] were able to demonstrate quantitatively that center bias is corre¬ 
lated strongly with photographer bias and is influenced by viewing strategy 
at scene onset. Furthermore, e.g., they were able to show that motor bias 
had no effect. However, they did not evaluate and computationally model 
how specifically the objects that attract the gaze are distributed spatially 
in the image. Instead, Tseng et al. hired five naive participants to provide 
subjective scores from 1 to 5 in terms of how interesting things were biased 
toward the image center [5|. In this paper, we use the data set by Achanta 
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Figure 2: Quantile-Quantile (Q-Q) plots of the angles versus a uniform distribution (left), 
radii versus a half-Gaussian distribution (middle), transformed radii (see Sec. |3.3| ) versus 
a normal distribution (right). 


et al. [H] to investigate the distribution of salient objects in photographs and 
then evaluate the influence on two state-of-the-art salient object detection 
models. 

3. Center Bias Model 

To investigate the spatial distribution of salient objects in photographs 
collected from the web, we use the manually annotated segmentation masks 
by Achanta et al. mm that mark the salient objects in 1000 images of 
the salient object data set by Liu et al. [7j. More specifically, we use the 
segmentation masks to determine the centroids of all salient objects in data 
set and analyze the centroids’ spatial distribution. The images in the data 
set by Liu et al. |7] have been collected from a variety of sources, mostly 
from image forums and image search engines. Liu et al. collected more than 
60,000 images and subsequently selected an image subset in which all images 
contain a salient object or a distinctive foreground object [7j. 9 users marked 
the salient objects using (rough) bounding boxes and the salient objects in 
the image database have been defined based on the “majority agreement”. 
However, as a consequence of the selection process, the data set does not 
include images without distinct salient objects. This is an important aspect 
to consider when trying to generalize the results reported on Achanta et al.’s 
and Liu et al.’s data set to other data sets or application areas. 

In order to statistically analyze the 2-dimensional spatial distribution of 
the salient objects’ centroids, we first identify the center of the spatial distri¬ 
bution. Then, given the distribution’s center, we can use a polar coordinate 
system to independently analyze the distribution of the angles and distances 


7 








between the center and the salient objects. 

3.1. The Center 

Our model is based on a polar coordinate system that has its pole at the 
image center. Since the images in Achanta’s data set have varying widths 
and heights, we use in the following normalized Cartesian image coordinates 
in the range [0, 1] x [0, 1]. The mean salient object centroid location is 
[0.5021, 0.5024]"'" and the corresponding covariance matrix is [0.0223, —0.0008; —0.0008, 0.0214]. 
Thus, we can motivate the use of a polar coordinate system that has its pole 
at [0.5, 0.5]^ to represent all locations relative to the expected distribution’s 
mode. 

3.2. The Angles are Distributed Uniformly 

Our hrst model hypothesis is that the centroids’ angles in the specified 
polar coordinate system are uniformly distributed in [—7r,7r]. 

In order to investigate the hypothesis, we use a Quantile-Quantile (Q-Q) 
plot as a graphical method to compare probability distributions (see [25]). In 
Q-Q plots the quantiles of the samples of two distributions are plotted against 
each other. Thus, the more similar the two distributions are, the better the 
points in the Q-Q plot will approximate the line f{x) = x. We calculate the 
Q-Q plot of the salient object location angles in our polar coordinate system 
versus uniformly drawn samples in [—7r,7r], see Fig. (left). The apparent 
linearity of the plotted Q-Q points supports the hypothesis that the angles 
are distributed uniformly. 

We can quantify the observed linearity, see Fig. (left), to analyze the 
correlation between the model distribution and the data samples using prob¬ 
ability plot correlation coefficients (PPCC) [25|. The PPCC is the correlation 
coefficient between the paired quantiles and measures the agreement of the 
fitted distribution with the observed data (i.e., goodness-of-fit). The closer 
the correlation coefficient is to one, the higher the positive correlation and 
the more likely the distributions are shifted and/or scaled versions of each 
other. Furthermore, by comparing against critical values of the PPCC (see 
[26] and [25]), we can use the PPCC as a statistical test, which is closely 
related to the Shapiro-Wilk test IZ71 and can reject the hypothesis that the 
data samples match the assumed model distribution. Furthermore, we can 
use the correlation to test the hypothesis of no correlation by transforming 
the correlation to create a t-statistic. 
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The obvious linearity of the Q-Q plot, see Fig. (left), is reflected by 
a PPCC of 0.998^ which is substantially higher than the critical value of 
0.8880 (see |26j) and thus the hypothesis of identical distributions can not 
be rejected. Furthermore, the hypothesis of no correlation is rejected at 
a = 0.05 (p = 0). 

3.3. The Radii follow a Half-Gaussian Distribution 

Our second model hypothesis is that the radii of the salient object loca¬ 
tions follow a half-Gaussian distribution. We have to consider a half-Gaussian 
distribution in the interval [0,(X)], because the radius - as a length - is by 
definition positive. If we consider the image borders, we could assume a 
two-sided truncated distribution, but we have three reasons to work with a 
one-sided model: The variance of the radii seems sufficiently small, the “true” 
centroid of the salient object may be outside the image borders (i.e., parts of 
the salient object can be truncated by the image borders), and it facilitates 
the use of various, well-known statistical tests (see m)- 

We can use a Q-Q plot against a half-Gaussian distribution to graphically 
assess the hypothesis, see Fig. [^(middle). The linearity of the points suggests 
that the radii are distributed according to a half-Gaussian distribution. The 
visible outliers in the upper-right are caused by less than 30 centroids that 
are highly likely to be disturbed by the image borders. Please be aware of 
the fact that it is not necessary to know the exact distribution parameters 
when working with Q-Q plots as long as the distributions are linearly related 
(see [25]j. Furthermore, we transform the polar coordinates in such a way 
that they represent the same point with a combination of positive angles in 
[0,7r] and radii in [— (X),(X)]. This way, we can compare the distribution of 
the transformed radii against a normal distribution with its mode and mean 
at 0, see Fig. (right). 

The obvious correlation that is visible in the Q-Q plots, see Fig. [^(middle 
and right), is reflected by a PPGG of 0.9987, which is above the critical value 
of 0.9984 (see [2H])- The hypothesis of no correlation is rejected at a = 0.05 

(p = 0). 


^Mean of several runs with N = 1000 uniform randomly selected samples. 
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4. Quantifying the Influence on Salient Object Detection 


To assess the influence of the center bias on pixel- and object-based salient 
object detection, we integrate a Gaussian center bias into the algorithms by 
Achanta et al. [in] and Cheng at al. 

4 . 1 . Center Biased Saliency Models 

4.1.1. Pixel-based 

As a pixel-based model, we use maximum symmetric surround saliency 
detection by Achanta et al. ng in combination with a Gaussian center bias 
map (cf., e.g., [U |3|). To this end, we define the center bias saliency map 


Sc e 


Sc{x,y) = g{fi^-x,fiy-y,a^,ay) with 


( 1 ) 

( 2 ) 



where {x,y) is the pixel coordinate, g — is the image center’s co¬ 

ordinate, and and ay are the standard deviation in x- and y-direction 
depending on the image width and height, respectively. 

In order to investigate the influence of the center bias, we investigate 
different, plausible strategies to investigate the combination of the bottom- 
up and center bias saliency maps Sb and Ac, respectively: 


Sp = f{Sc,SB), 


(3) 


where / is the chosen center bias integration scheme. 

We consider the following schemes, cf. [2H|: First, a convex, linear inte¬ 
gration, i.e. /+(tecAc, Ab) = wcSq+wbSb withtCB+tfc = 1 {wb,wc G Rq)- 
Second, multiplicative integration as a supra-linear combination method, 
i.e. fo{wcSc, Sb) = Ac o Ab, where o denotes the Hadamard product. 
Third, the minimum as a further, alternative supra-linear combination, i.e. 
fiiwcSc, Sb) = min(Ac,AB). Fourth, the maximum to realize a late, sub- 
linear combination scheme, i.e. f^{wcSc, Sb) = max(Ac,AB). All these 
schemes are also related to different Fuzzy logic interpretations, which might 
provide a common theoretical framework and interpretation throughout later 
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Figure 3: An example illustrating the influence of the implicit center bias in the region 
contrast method by Cheng et al. [9]. Left-to-right: Image, region contrast (w/o explicit 
center bias), and locally debiased region contrast (w/o explicit center bias). 


applications (e.g., PH). To improve the readability, we refer to the linear 
combination for explicit center bias integration - unless stated otherwise, of 
course - in the following . 

J1..I.2. Segmentation-based 

As a segmentation-based model, we adapt Cheng et ah’s region contrast 
model [H]. This model is particularly interesting, because it already provides 
state-of-the-art performance, which is partially caused by an implicit center 
bias as we will show in the following. This way, we can observe how the model 
behaves if we remove the implicit center bias - which was neither motivated 
nor explained by the authors - and add an explicit Gaussian center bias. The 
spatially weighted region contrast saliency equation is defined as follows 


-5s(rfc) = Dsirk;ri)w{ri)Dr{rk;ri) with (4) 

Ds{rk\ri)^ eyiY>{-Ds{rk\ri)/al). (5) 

w{ri) is the weight of region r,, which equals the number of pixels in r* - i.e., 
w{ri) = |rj| - to emphasize color contrast to bigger regions. ■) is the 

color distance metric between the two regions 

Dr{ri;r2) = 5/5//(ci;i)/(c2;,)D(Q;c,), (6) 

Cl C2 


where f{ck;i) is the (frequentist) probability of the i-th color Ck-i among all 
Uk colors in the k-th region r^, which is determined using a color histogram. 
The probability of the color inside the regions f{ck-i) is used as weight to 
emphasize color differences between dominant colors. D{ci;Cj) measures the 
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Figure 4; Examples of the influence of the implicit and explicit center bias on segmentation- 
based salient object detection. Left-to-right: Image, region contrast without and with 
center bias (RC and RC-I-CB, resp.), and locally debiased region contrast without and 
with center bias (LDRC and LDRC-I-CB, resp.). 

distance between the colors and in the following it is defined as being the Eu¬ 
clidean distance in the CIE Lab color space. Einally, Ds{rk',Ti) is the spatial 
distance between regions and r*, where a a controls the spatial weighting. 
The spatial distance between two regions is defined as the Euclidean distance 
between the centroids of the respective regions using pixel coordinates that 
are normalized to the range [0, 1] x [0, 1]. Smaller values of ag influence the 
spatial weighting in such a way that the contrast to regions that are farther 
away contributes less to the saliency of the current region. 

It is this unnormalized Gaussian weighted Euclidean distance Da{rk\Ti) 
that causes an implicit Gaussian-like center bias (see Fig. |^and[^, because 
it favors regions whose distances to the other neighbors are smaller, which 
is - in general - the case for segments at the center of the image. Although 
this biased distance function has a significant impact on the performance, 
its choice has not been clearly motivated, discussed, or evaluated by Gheng 
et ah To remove this implicit bias, we introduce a normalized, i.e. locally 
debiased, distance function Ds{rk', a) Ihaf still weights close-by regions higher 
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Figure 5: Illustration of the implicit center bias in the method by Cheng et ah [9]. 
Left: Each pixel shows the distance weight sum, i.e. L)s(r/e; r^), to all other pixels 
in a regular grid. Right: The average weight sum depending on the centroid location 
calculated on the Achanta/Liu data set using Felzenszwalb’s segmentation method m- 


than further away regions, but does not lead to an implicit center bias 

Ds{rk;ri) 


Ds{rk\ri) = 


I.e. 


Vrfc : '^bs{rk\ri) = 1. 


(7) 

( 8 ) 


Similar to the pixel-based model (see Sec. 4.1.1), we can now integrate 
an explicit center bias into the segmentation-based model 


S^Tk) ^ f(Yl bs{rk] ri)w{ri)Dr{rk] n), g{C{rk); <7^, ay) j . (9) 

\rk^ri / 

Here, / is the chosen center bias integration fnnction as in Eq. Further¬ 
more, C{rk) denotes the centroid of region and g is defined as in Eq. 

4 .-2. Quantitative Evaluation 
4-2.1. Dataset 

As for the graphical investigation of our hypotheses using Q-Q plots (see 
Fig. g, we use the manually annotated segmentation masks by Achanta et 
al. [U Hn], see Sec. to quantify the inflnence of the Gaussian center bias 
on salient object detection. 

4 -2.2. Baseline Algorithms 

In order to compare our results, we use a set of saliency detection algo¬ 
rithms that we group into two coarse categories: First, algorithms that were 
specifically proposed for salient object detection and, second, algorithms that 
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have been proposed and evaluated in other contexts. From the second cate¬ 
gory, we use: The well-known saliency model by Itti and Koch EH, Graph- 
Based Visual Saliency (GBVS) by Harel at al. [32], Gontext-Aware Saliency 
(GAS) by Goferman et al. [3SIIS1|, and the FFT’s spectral residuals (FFT) 
and DOT image signatures (DOT) by Hou et al. jSSl ESj- For FFT and 
DOT, we optimized the resolution at which the saliency maps are calculated, 
which is the most important algorithm parameter and has a significant influ¬ 
ence on the performanc^ As baseline for salient object detection algorithms 
(first category), we use: The Frequency-Tuned model (FT) by Achanta et al. 
EQ the Bonn Information-Theoretic Saliency model (BITS) by Klein et al. 
jlBI, the Maximum Symmetric Surround Saliency (MSSS) model by Achanta 
et al. [T0|, and the Region Gontrast (RG) model by Gheng et al. jl] that 
uses Felzenszwalb’s image segmentation method ffl. The latter two are the 
original algorithms we adapted. 

Of course, we evaluate our adapted, center biased models: The maximum 
symmetric surround saliency with center bias (MSSS-I-GB; see Sec. 4.1.1) and 
the region contrast model with explicit center bias (RG+GB; see Sec. 4.1.2). 
In order to investigate the influence of the implicit center bias in the region 
contrast model (see Sec. 4.1.2), we calculate the performance of the locally 
debiased region contrast model without and with explicit center bias (LDRG 
and LDRG+GB, respectively; see Sec. 4.1.2). Additionally, as a reference we 
provide the results for the standalone segment-based and pixel-based center 
bias models, i.e. wc = 1 (CBg and GBp, respectively). 


Implementation notes. If available, we used the reference implementations 
that have been provided by the authors. For MSSS we use the G++ imple¬ 
mentation by Achanta, because it provides a better performance than the 
basic Matlab implementation. For Itti we use the iLab Neuromorphic Vision 
Toolkit (iNVT). We integrated the methods directly into Matlab (mex) in 
order to avoid quantization and/or compression artifacts that may occur due 
to saving and loading them as images. For DGT and FFT, we used the im- 


^We were surprised by the fact that the spectral approaches (i.e., FFT and DCT) 
performed so well, because the previously reported results for FFT stated otherwise (see, 
e.g., IHIISIIISI). However, this can probably be explained by the fact that we analyzed the 
influence of the saliency map resolution on these approaches, which is their most important 
parameter and has a considerable influence on the results. 

^When comparing with the results in [8], please read the erratum that has been pub¬ 
lished at http://ivrg.epf1.ch/supplementary_material/RK_CVPR09 
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Figure 6: Precision-recall curves for all evaluated models with full (top) and limited range 
of the precision (bottom). This graphic is best viewed in color. 


plementations in our publicly available Matlab toolbox EH- All calculations 
have been made using double precision arithmetic. To make our results as 
reproducible as possible (we have observed that the precision-recall curves 
of different authors vary), we will make our implementations and evalua¬ 
tion scripts open source. We would like to note that our evaluation measure 
implementations follow the implementations of Weka and LingPipe. The cor¬ 
responding precision-recall curves and resnlts of further baseline algorithms 
can be seen in Fig. 

4-.2.3. Measures 

We can use the binary segmentation masks for saliency evaluation by 
treating the saliency maps as binary classifiers. At a specific threshold t we 
regard all pixels that have a saliency value above the thresholds as positives 
and all pixels with values below the thresholds as negatives. By sweeping 
over all thresholds min(S') < t < max(5'), we can evaluate the performance 
using common binary classifier evaluation measures. 

Most commonly, precision-recall cnrves are nsed - e.g., by Achanta et al. 
puni, Cheng et al. [1], and Klein et al. [H] - to evaluate the salient object 
detection performance. We use five evaluation measures to quantify the per¬ 
formance of the algorithms. We calculate the area under curve (AUC) of the 
(interpolated) precision-recall curve (PR) and the receiver operating charac¬ 
teristic (ROC) cnrve [38]. Complementary to the PR AUC, we calcnlate the 
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maximum Fi and scores with 

precision • recall 
• precision + recall 

Fp with /3 = '/KS has been proposed by Achanta et ah to weight precision 
more than recall for salient object detection [H]. Additionally, we calculate 
the hit-rate (HR) that measures how often the pixel with the maximum 
saliency belongs to the salient object. 



Ff, = il + P^) 


4-2.4- Results 

Explicit center bias integration type. How does the performance depend on 
the chosen center bias integration? To investigate this question, we tested the 
minimum, maximum, and product as alternative combinations. To account 
for the influence of different value distributions within the normalized value 
range, we also weighted the input of the min and max operation (e.g., A™™ = 
min(wc'S'c, w^Sb))- The results of the algorithms using different combination 
types are shown in Tab. The presented results are the results that we 
achieve with the center bias weight that results in the highest Fi score. 

In Tab. we can see that the linear combination is the best choice for 
LDRC-I-CB. However, for MSSS-I-CB and RC+CB the product seems to be 
the combination that provides the best performance. Apparently MSSS+CB 
benefits more from using the product as combination type than RC+CB. 
Also interesting to note is that LDRC+CB with the product as combination 
achieves similar results to RC. However, LDRC+CB remains the algorithm 
that provides the best performance in terms of Fi score and Fy score whereas 
RC+CB provides the best performance in terms of PR AUC and HR. Inter¬ 
estingly, LDRC+CB and RC+CB achieve a nearly identical ROC AUC. 


Convex center bias weight. How does the weight of the center bias influence 
the performance? To answer this question, we calculated the performance of 
LDRC+CB, RC+RB, and MSSS+CB with wc G [0,1] in 0.025 steps. The 
resulting curves of the Fi score, Fy score, PR AUC, ROC AUC, and hit-rate 
are shown in Fig. 7(a)[ 7(b)| and |7(c)[ respectively. 

For each of the three algorithms the values of wq that lead to the optimal 
Fi score, score, PR AUC, and ROC AUC lie within a small interval. In 
contrast, for all algorithms the value of wc that achieves the highest hit-rate 
is outside these intervals and substantially higher. Furthermore, the best 
weight for each measure depends on the algorithm and varies substantially. 
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Method 

Combination 

Fi 


/PR 

/ROC 

HR 

LDRC+CB 

Linear / Convex 

0.8034 

0.8183 

0.8800 

0.9624 

0.9240 

LDRC+CB 

Max 

0.7504 

0.7561 

0.8108 

0.9422 

0.8630 

LDRC+CB 

Min 

0.7897 

0.8049 

0.8584 

0.9535 

0.8880 

LDRC+CB 

Product 

0.7883 

0.8024 

0.8704 

0.9578 

0.9130 

RC+CB 

Linear/Convex 

0.7973 

0.8120 

0.8833 

0.9620 

0.9340 

RC+CB 

Max 

0.7855 

0.7993 

0.8710 

0.9568 

0.9140 

RC+CB 

Min 

0.7962 

0.8150 

0.8807 

0.9603 

0.9180 

RC+CB 

Product 

0.7974 

0.8136 

0.8878 

0.9623 

0.9460 

MSSS+CB 

Linear/Convex 

0.7490 

0.7678 

0.8265 

0.9495 

0.8900 

MSSS+CB 

Max 

0.7165 

0.7337 

0.7849 

0.9270 

0.8420 

MSSS+CB 

Min 

0.7373 

0.7606 

0.8211 

0.9339 

0.9140 

MSSS+CB 

Product 

0.7523 

0.7748 

0.8398 

0.9445 

0.9350 

LDRC 

- 

0.7574 

0.7675 

0.8302 

0.9430 

0.8680 

RC 

- 

0.7855 

0.7993 

0.8710 

0.9568 

0.9140 

MSSS 

- 

0.7165 

0.7337 

0.7849 

0.9270 

0.8420 

CBs 

- 

0.5793 

0.5764 

0.5920 

0.8623 

0.6980 

CBp 

- 

0.5604 

0.5452 

0.5638 

0.8673 

0.7120 


Table 1: The maximum Fi score, maximum F/j score, PR AUC (/PR), ROC AUC 
(/ROC), and Hit-Rate (HR) that we obtain using different combination types. 


It is interesting to see that small weights only have a minor (yet positive) 
influence on RC+CB until a point is reached (roughly at wc = 0.55) where 
the performance begins to drop significantly. This becomes especially appar¬ 
ent when comparing the curves of RC+CB, see Fig. 7(b)| with the curves of 


LDRC+CB, see Fig. 7(a) 


Quantitative comparison. The center bias itself already has a considerable 
predictive power, see Tab. and is relatively close to the performance of 
FT. However, there is a substantial performance gap between the standalone 
center bias models (CBs and CBp) and good non-biased methods such as, 
e.g., MSSS and LDRC. 


As could be expected, the performance of RC drops substantially if we 
remove the implicit center bias as is done by LDRC (see Sec. 4.1.2), which 
can best be seen in Tab. What happens if we add our explicit center 
bias model to unbiased models? As can be seen in the performance differ¬ 
ence between MSSS and MSSS+CB as well as the performance difference 
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(a) RC (b) MSSS (c) LDRC 


Figure 7: Rlustration of the influence of the weight wq on the performance of RC+CB, 
LDRC+CB, and MSSS+CB (convex combination). 

between LDRC to LDRC+CB, the performance is substantially increased 
with respect to all evaluation measures, see Tab. and Interestingly, 
the relative performance improvement from pixel-based MSSS to MSSS+CB 
and segment-based LDRC to LDRC+CB is comparable, see Tab. Further¬ 
more, with the exception of HR, the performance of LDRC+CB and RC+CB 
is nearly identical with a slight advantage for LDRC+CB (see Tab. and 
Tab. 1^. This indicates that we did not lose important information by de¬ 
biasing the distance metric (LDRC+CB vs RC+CB) and that the explicit 
Gaussian center bias model is advantageous compared to the implicit weight 
bias (LDRC+CB and RC+CB vs RC). 

In summary, MSSS+CB provides a substantially higher performance than 
MSSS and outperforms, e.g., FT and BITS. RC+CB and LDRC+CB pro¬ 
vide a better performance than their unbiased counterparts RC and LDRC, 
respectively. Furthermore, their performance is very similar and both out¬ 
perform all other models. Interestingly, LDRC is the best model without 
center bias in our evaluation on Achanta’s data set. This makes LDRC an 
interesting candidate for applications in which the image data can not be ex¬ 
pected to have a photographer’s center bias (e.g., image data of surveillance 
cameras, autonomous robots, or human-robot interaction H). 

Statistical significance. One question remains: Does the integration of an 
explicit center bias result in a statistically significant performance improve¬ 
ment? To address this question, we test the performance (i.e., Fi, FJ 3 , /PR, 
and /ROC) of LDRC and MSSS with and without an explicit center bias. 
For this purpose, we rely on two pairwise, two-sample t-tests: First, we per¬ 
form a two-tailed test to check whether the compared performances with and 
without an integrated center bias come from distributions with equal means 
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Method 

Fi 


/PR 

/ROC 

HR 

LDRC+CB 

0.8034 

0.8183 

0.8800 

0.9624 

0.9240 

RC+CB 

0.7973 

0.8120 

0.8833 

0.9620 

0.9340 

RC 

0.7855 

0.7993 

0.8710 

0.9568 

0.9140 

MSSS+CB 

0.7490 

0.7678 

0.8265 

0.9495 

0.8900 

LDRC 

0.7574 

0.7675 

0.8302 

0.9430 

0.8680 

BITS 

0.7342 

0.7582 

0.7589 

0.9316 

0.7540 

MSSS 

0.7165 

0.7337 

0.7849 

0.9270 

0.8420 

FFT 

0.6455 

0.6375 

0.6593 

0.8926 

0.8080 

DCT 

0.6472 

0.6368 

0.6612 

0.8962 

0.8270 

GBVS 

0.6403 

0.6242 

0.6970 

0.9088 

0.8480 

FT 

0.5995 

0.6009 

0.6261 

0.8392 

0.7100 

CBs 

0.5793 

0.5764 

0.5920 

0.8623 

0.6980 

CAS 

0.5857 

0.5615 

0.5888 

0.8741 

0.6920 

CBp 

0.5604 

0.5452 

0.5638 

0.8673 

0.7120 

iNVT 

0.3383 

0.4012 

0.4396 

0.5768 

0.6870 


Table 2: The maximum Fi score, maximum Fp score, PR AUC (/PR), ROC AUC 
(/ROC), and Hit-Rate (HR) of the evaluated algorithms (sorted ascending by Fp). 


(i.e., %=■. “means are equal”). Second, we perform a one-tailed test to check 
whether the performance with an integrated center bias is worse that without 
an integrated center bias, i.e. the center biased performance distribution’s 
mode is lower (i.e., 'H<: “mean is lower”). If we can reject both hypothe¬ 
ses, then it is clear that the performance of the algorithm has significantly 
improved due to the integrated center bias. All tests are performed at a 
conhdence level of 95%, i.e., a = 5%. 

For MSSS, we can reject the hypothesis of equal mean for Fi, /PR, 


Method 

Baseline 

Fi 

Fp 

/PR /ROC 

HR 

LDRC 

RC 

96.4 

96.0 

95.3 

98.6 

95.0 

RC+CB 

RC 

101.5 

101.6 

101.4 

100.5 

102.2 

LDRC+CB 

RC 

102.3 

102.4 

101.0 

100.6 

101.1 

LDRC+CB 

LDRC 

106.1 

106.6 

106.0 

102.1 

106.5 

MSSS+CB 

MSSS 

104.5 

104.7 

105.3 

102.4 

105.7 


Table 3: Relative performance (in %) of our adapted algorithms with respect to their 
baseline. 
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and fROC with = 0.0285, pp^ = 0.0031, Pjpr = 5.252 x 10 and 
Pf ROC = 2.618 X 10 respectively. Additionally, we can reject the hy¬ 
pothesis that an integrated center bias has a negative influence on the per¬ 
formance with pp^ = 0.0142, pp^ = 0.0015, P/pr = 2.626 x 10“^, and 
Pf ROC = 1-309 X 10-16. 

Similarly, we can reject the hypothesis that the performance of LDRC 
with and without center bias has an equal mean for Fi, /PR, and 
/ROC with pf^ = 0.0018, pp^ = 2.426 x 10“/ pjpR = 1.118 x 10"/ 
and PfRoc = 1.555 x lO-^, respectively. And, we can reject the hypothesis 
that an integrated center bias has a negative influence on the performance 
with pf^ = 9.071 X 10“^, pp^ = 1.213 x lO”®, Pjrr = 5.590 x 10“®, and 
PjRoc = 7.773 X 10-6. 

Consequently, it is apparent that the integration of a center bias can lead 
to statistically significant performance improvements for pixel-based as well 
as segmentation-based algorithms. 

5. Conclusion 

We formulated and investigated two hypotheses about the location of 
salient objects in photographs: First, the radial centroid distribution around 
the image center is uniform. Second, the distances between their centroids 
and the image center follow a normal distribution. We investigated these 
hypotheses using graphical methods, which indicate that our hypotheses are 
true. This is an important insight, because it provides a strong empirical mo¬ 
tivation and justification for the widely applied Gaussian center bias models. 
To investigate the influence of the center bias on salient object detection, 
we explicitly integrated the center bias model in two state-of-the-art salient 
object detection algorithms. We have shown that the explicitly modeled cen¬ 
ter bias has a significant, positive influence on the performance (in terms of 
hit-rate, the area under the precision-recall curve, the area under the receiver 
operating characteristic curve, the Fi score, and the Fjs score). Last but not 
least, by debiasing Cheng et al.’s region contrast model, we have exemplarily 
shown that implicit center biases might at least partially be responsible for 
the performance of state-of-the-art salient object detection algorithms and as 
a consequence we introduced an adapted, non-biased salient object detection 
algorithm. 
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